Fuzzy Imputation Method for Database Systems

نویسندگان

José Ignacio Peláez

Jesús M. Doña

David La Red

چکیده

The missing data and nonresponse problem is a usual difficulty of particular concern in medical and social science databases. Dealing with nonresponse can be a difficult matter and it is important to apply adequate missing data methods to obtain valid inference. Missing data is a very common problem in real data sets, and different methods to solve this problem have been developed. A simple and common strategy is to ignore missing values, thus reducing the size of the useful data set. The experience in databases has demonstrated the dangers of simply removing cases (listwise deletion) from the original data set, and deletion can introduce AbstrAct

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards Missing Data Imputation: A Study of Fuzzy K-means Clustering Method

In this paper, we present a missing data imputation method based on one of the most popular techniques in Knowledge Discovery in Databases (KDD), i.e. clustering technique. We combine the clustering method with soft computing, which tends to be more tolerant of imprecision and uncertainty, and apply a fuzzy clustering algorithm to deal with incomplete data. Our experiments show that the fuzzy i...

متن کامل

Microsoft Word - ICAME09_opti_leslabay_final

There are many situations where input feature vectors are incomplete and methods to tackle the problem have been studied for a long time. A commonly used procedure is to replace each missing value with an imputation. This paper presents a method to perform categorical missing data imputation from numerical and categorical variables. The imputations are based on Simpson’s fuzzy min-max neural ne...

متن کامل

Microsoft Word - 5_.rtf

متن کامل

Microsoft Word - Pilar Rey-del-Castillo.rtf

متن کامل

On a Fuzzy c-means Algorithm for Mixed Incomplete Data Using Partial Distance and Imputation

The focus of fuzzy c-means clustering method is normally used on numerical data. However, most data existing in databases are both categorical and numerical. To date, clustering methods have been developed to analyze only complete data. Although we sometimes encounter data sets that contain one or more missing feature values (incomplete data), traditional clustering methods cannot be used for s...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2008

Fuzzy Imputation Method for Database Systems

نویسندگان

چکیده

منابع مشابه

Towards Missing Data Imputation: A Study of Fuzzy K-means Clustering Method

Microsoft Word - ICAME09_opti_leslabay_final

Microsoft Word - 5_.rtf

Microsoft Word - Pilar Rey-del-Castillo.rtf

On a Fuzzy c-means Algorithm for Mixed Incomplete Data Using Partial Distance and Imputation

عنوان ژورنال:

اشتراک گذاری